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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: CATCHESIDE; DAVID E. 
(ii) TITLE OF INVENTION: REAGENTS AND METHODS FOR DIVERSIFICATION 

OF DNA 

(iii) NUMBER OF SEQUENCES: 2 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merchant, Gould, Smith, Edell, Welter & Schmidt 

(B) STREET: 3100 Norwest Center, 90 South 7th Street 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: USA 

(F) ZIP: 55402 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/751,962 

(B) FILING DATE: 29-Dec-2000 ~~ 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/977,171 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Skoog, Mark T 

(B) REGISTRATION NUMBER: 40,178 

(C) REFERENCE/DOCKET NUMBER: 10552. 13US01 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-332-5300 

(B) TELEFAX: 612-332-9081 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GATCGCAACT GGAGATCACT CGCACCGTGC CGCAGAACAA GGGCGACGAG CCTCAGGGCA 60 
GTTTAGCCTG CCGTAACAGC ACAGACCATA GCTTATTTTC ACCTGGGCGG GCGGGCGACG 120 
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63 


GCGGCACTGA 


CATCGGCAAG 


GCGGCATCAA 


GCAACCCCTC 


TGTTGCTTGC 


CAGCTGCCGG 


180 


fid 


rCAACGTCAG 


CGGTACAAGG 


AGAAATCTGG 


AAGGAAAGAC 


TTCTGGCACC 


GACAGGATGG 


240 


65 


CACGCGGGAA 


AAGTTCCCAA 


TGCATGAGAT 


GAGGGGCATT 


TGCATTGCCT 


CCCGTCACAC 


300 


66 


TGCCCGCGAA 


CCCCAACCCC 


ACCATAGCGT 


CTGTCGATAC 


ATGGAGCGCG 


AAGTCGAGAA 


360 


67 


ACCTGTAATT 


CCTGGTAACT 


TTCAGGTACA 

X X l1 \J \J X Xl XA 


CAGTACGTAC 


TGATCCTGGT 


ATCAAACCTT 


420 


68 


GCCTGCCGAG 


TTTTCGACGG 

J. A- -M~ J~ V_J ik. V_- VrJ 


AAAGAGGTGT 


GAATTGTGAA 


AGAGTCATAC 


CAAATCACCC 


480 


69 


GATTTTCATA 


AAGCCCGAGT 


CTTTTCTGTA 

X X X X X X £ « 


CATAAGCGAC 


ACTCGAAGCG 


GGCCTCATCT 


540 


70 


TCATAGCCTG 


ATAGCTTGTA 


ATACTCCATC 


CTCGTATCTC 


ACTTGACCTT 


GAGTTCAACC 


600 


71 


rrACGTCAGA 


CTTCACrCGA 


CACATCGACG 


GATTGGGGAA 


CAGCACAATA 


CCTGAAAAGC 


660 


72 


GAGAAAACCA 


AACAGAGGAA 


AACACCATGG 


AGACAACACT 


TCCCCTCCCC 

X V* V^ X X*» Xrf x^ 


TTCCTCGTCG 


720 




GTGTPAGTGT 


TrrTcrcGGA 


TTGAATGACA 


TCAAGGAGGG 


CCTCAGCCGG 

X X^£^ VJ x^ x^ xJ VJ 


GAGGAAGTCT 


780 


74 


CGTGTCTTGG 


CTGCGTCTTC 


TTCGAGGTCA 

X X \_'\_f£^U \J X \^X^ 


AGCCCAAGAC 


CCTTGAGAAA 


ATCGTGCGAT 


840 


75 


TCCTCAAGCG 


TCACAATGTC 


GAATTTGAGC 


CCTACTTCGA 


TGTAACAGCC 

X \J X £^rf^ X^fr^ VJ X** X^ 


CTCGAGTCTA 


900 


76 


TPRATflATAT 


X C\ X X X X 


CTGGAPGrCG 


GCGCCCGCAA 


GGTGTTTGTC 

Vj Vj X VJ XXX VJ X V^ 


AAGACCGAGC 


960 


77 




CCTCTCCGCA 


TATGGPTrrC 

X n.x VJuO X v^v^v> 


GCGTTGrrcr 


CATTGTCACT 

V^i* X X VJ X V^X^Vht X 


GGAAGCAGCG 

VI xjxrrum VJ v*^ fr* vj \^ ^j 


1020 


78 




TTCCTCCGCC 


ACCGAGAGCG 


GCCTTTTGCT 

X X X X Wht X 


CTCCGGCTTC 

X V^ V^ VJ VJ V^ X X v^ 


GATCAGACTG 


1080 


7Q 




TGPArAGTTT 


CTGGAGGAGG 


CCAGAGACAA 


GAAAATTACC 

\jC\^\m^C^A. Xf^V^V^ 


CCCTTCTTCA 


1140 


ftfl 




TCrTGGGGCr 


GATPTrGAAC 


AGTTCATCCA 

£x\j X Xx^xVXV^X^n 


GGTCGCCGCC 


AAGGCTAACG 

C^C\ VJ VJ X^ X A. 4 \^ V 


1200 


O-L 




rTTGrTATfr 


ACTGGrTTGA 


CAACAAAGAA 


GGACGAGGCC 

XJ \J Vj iVxj VJ V-. V-' 


GGAAAGCTTG 

VJ VJXlflX^VJ V_^ X X VJ 


1260 






rATrrTCTCG 

X V— X V-* X vj 


AGCGTCTGGA 


AGTCTGACCG 


TCCCGATGGT 

X V^ V^ V^ VJ£^ X VJ VJ X 


CTGCTCCCCA 

V* X VJ V^ X X> V-* V** V** 4 « 


1320 




rPGTTGTCGT 

wv^u X X V7 X \^V7 X 


TGATGAGPAT 


GArArTGCTC 


TGGGTCTGGT 


CTACAGCAGT 

X il V^ VJ Vj» ii VJ X 


GCCGAGAGTG 


1380 




TGAAPGAGGP 


rrTrAnfiAfA 


rAfJArTGGTG 

v^nVJnv^ X \J\J X \J 


TCTATCAGAG 


CCGGAAGCGC 

V- \ — VJ VJ tiii vJ V-' VJ v^ 


GGTCTCTGGT 

VJ VJ X V-" X V-- X VJ VJ X 


1440 


85 


APAAGGGTGC 


TACTTCCGGA 


GACACTCAGG 


AGCTCGTCCG 


CATCTCGCTT 

wX i X Vp* X V^ VJ V* X X 


GACTGCGATA 

XJX^Xa' X XJ X*' XJ£ 


1500 


86 


ACGATGrTCT 


CAAGTTTGTC 


GTGAAGCAGA 


AGGGTCGTTT 


CTGCCACCTC 

Vj» X VJ V^ V^'t* V^ ^« X v^ 


GATCAGTCCG 


1560 


o / 








iGr'TPGAGr'A 
rWjV- J. v^OrVvJv^rt 


V3f\^_- XXX \3t\ X X 


TPGAGGAAAr 


1 6?0 


38 


AGTCTGCCCC 


CGAGGGCTCC 


TACACTGCCC 


GTCTCTTCTC 


CGATGAGAAG 


CTAGTCCGGG 


1680 


89 


CCAAGATCAT 


GGAGGAGGCT 


GAGGAGCTCT 


GCACCGCTCA 


GACCCCCCAG 


GAAATCGCCT 


1740 


90 


TTGAGGCTGC 


CGATCTCTTC 


TACTTTGCTC 


TTACCAGGGC 


CGTTGCTGCC 


GGCGTTACTC 


1800 


91 


TTGCCGATAT 


CGAAAGGAGC 


CTTGACGCCA 


AGAGCTGGAA 


GGTCAAGCGC 


AGGACTGGAG 


1860 


92 


ATGCTAAGGG 


TAAGTGGGCT 


GAGAAGGAGG 


GCATCAAGCC 


TGCGGGGTGG- 


GCTCCCGCTG 


1920 


93 


CCACTTCGGC 


CCCTGTCACC 


AAGGAGGCCG 


CCCAGGAGAC 


CACCCCTGAG 


AAGATCACCA 


1980 


94 


TGAGACGTTT 


CGACGCCTCC 


AAGGTCTCTA 


CCGAGGAGCT 


CGATGCTGCT 


CTCAAGCGTC 


2040 


95 


CTGCGCAAAA 


GTCGTCCGAT 


GCCATCTACA 


AGATCATTGT 


CCCCATCATC 


GAGGACGTCC 


2100 


96 


GCAAGAACGG 


CGACAAGGCT 


GTTCTGTCGT 


ACACTCACAA 


GTTCGAGAAG 


GCTACCTCTC 


2160 


97 


TTACTAGCCC 


CGTCCTGAAG 


GCGCCCTTCC 


CCAAGGAGCT 


TATGCAGCTC 


CCTGAGGAGA 


2220 


98 


CCATTGCTGC 


CATCGACGTG 


TCCTTCGAGA 


ACATCCGCAA 


GTTCCACGCC 


GCCCAGAAGG 


2280 


99 


AGGAGAAGCC 


CCTCCAGGTC 


GAGACCATGC 


CCGGTGTTGT 


CTGCAGCCGT 


TTCTCTCGTC 


2340 



100 CCATCGAGGC CGTCGGCTGC TACATCCCCG GCGGTACCGC CGTTCTCCCC AGCACTGCCC 2400 

101 TTATGCTGGG TGTTCCCGCC ATGGTCGCCG GCTGCAACAA GATTGTGTTC GCCTCTCCTC 2460 

102 CCCGCGCCGA CGGAACCATC ACTCCCGAGA TTGTCCACGT CGCTCACAAG GTTGGGGCCG 2520 

103 AGTCCATCGT GCTTGCCGGC GGTGCCCAGG CCGTAGCTGC CATGGCCTAC GGCACCGAGA 2580 

104 GCATCACCAA GGTCGACAAG ATTCTCGGCC CCGGTAACCA GTTCGTCACT GCTGCCAAGA 2640 

105 TGTTCGTCAG CAACGACACC AACGCTGCCG TTGGGATTGA CATGCCCGCT GGCCCGTCCG 2700 

106 AGGTGCTGGT CATCGCTGAC AAGGACGCCA ACCCCGCGTT CGTTGCCTCG GATCTCCTGT 2760 

107 CCCAGGCTGA GCACGGCGTT GACAGTCAGG TCATCCTGAT CGCTATTAAC CTCGACGAGG 2820 
108- AGCATCTTCA GGCTATTGAG GACGAGGTTC ACCGTCAGGC TATGGAGCTT CCTCGCGTCC 2880 

109 AGATTGTCCG TGGCTCCATC GCCCACTCGA TCACCGTGCA GGTCAAGACC GTCGAGGAGG 2940 

110 CCATGGAGCT CAGCAACAAG TACGCTCCTG AGCACTTGAT CCTCCAGATC AAGGAGGCCG 3000 

111 AGAAAGCTGT CGATCTTGTC ATGAACGCTG GTAGTGTCTT CATTGGCGCT TGGACTCCTG 3060 
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112 AGTCCGTTGG CGATTACTCT GCTGGTGTTA ACCACTCGCT GCGTAAGTTA CATATCATAA 

113 ATAGCCCCGC TTCACAGATT CTTCTGCTAA CGTCAAGACA CATAGCTACC TATGGTTTTG 

114 GCAAGCAGTA CTCTGGCGTC AATCTCGCCT CGTTCGTCAA GCACATTACC AGCTCCAACT 

115 TGACTGCCGA GGGTCTCAAA AACGTCGGCC AGGCTGTCAT GCAGTTGGCT AAGGTTGAGG 

116 AGCTCGAGGC TCACAGAAGG GCGGTCAGCA TCCGTCTTGA GCACATGAGC AAGAGCAACT 

117 AGACGGAAAT TCTTTTTCGA AGTTGCAAAA AAAACAAGAA CAAAAGGATG TAGTGGGTTG 

118 ATGTATATCT GGGTCATTTT GGGCACATAG AGTAATGATA ACGAGTTTTG GACATTGTAC 

119 TGTTCTGTAC AGGCTGAAGA TCAGTACATG AATCTGTTGG TAAGTGTAGA GACCCAAACG 

120 TCCCTTGAGT TTTTCTCCCT GTTCCAGAGA GGTGCTCGTC CCTGGGTGTT TATTTTCATT 

121 ATTACATCAA CCTTTTATTT TATTTTATTT TTTATTTTAC TTTTTTTTCC TTTTTTTCAG 

122 ATCATGCGTA CATGAACGGG GGAAGCACAG ACGATCGAAA CGTGGATGTC ACAATGTCGC 

123 TGCAGTGATG CTGCATTGCA TGAAGCGCCC ATCTCAATAT ACTTGCAGTC TTGCGCGTTG 

124 CACGTGAACT TCCCAAACAA CCGAATAAAA GACGGCGAAA AATGAAGATA AAAAAAAACC 

125 ATAATAAAAA TCGGAGGGAG TGTGGGAAAT GGTTTCTTTT AGCATTTAGA CCCCATAGCC 

126 GTGCACGCCC GGGTACAGAC AGGTTCATCG ATGTTGACAT TGACTGGGAC ACCAGGTCTA 

127 TGTATTTCAT CTCCTGTCCT CTACCATACA TCGGGACATC GGACATCTCG CTGTACCCCC 
12 8 CACACCCACA AAGTCTTATA AAAGCGCCAC ACCCGAGGAG GTTCGGTCGG CCCCACGAAC 

129 TCCGTGCCTC CCTGGGTGTT TACAGGGACC GAACGCTGGA GAAGCTTAGT TTCCTGACAT 

130 CCGGCCTACC CGAGCAGGAA AAGGGACAGC TCATAGGCGA GGAGGGATTT GAAGATGGGG 

131 ACATTTTGGA TGATTCGAGA GGAGGAACTA GGTACTGTAT CATGATAGTT CGGGGCAGCA 

132 TCTTGGCTGG GACATTGTTA ATACCTCGAT ATGATGAAGT GGGAGGGAGT TTTTTCATGT 

133 CTTGCCCAAG TCCCACTAAT CTTTTTTTTT TTTTGTACCA ACACCCAAGA TTCGGAGAAT 

134 AGTGTAAGGA TTCGCATTCA CAAGTGGAAG TCTGAGGATC TTTTTATATC TTTGTCTTCC 

135 GCGGACTGTT AACGATCCTA CAGCGAGCGA GCGAGCGGTC GGATGCGCTG ATCTGATAGG 

136 TGCAATATAC GGCCGCTTTC TCCGGTCGTG TAGTGTAAGC TCTGTCGGCA TAGTAGTACA 

137 CTAAAAAAAC CCTTGCATTT CATGATCTGC TTGCTATTCA TTCCGAGTTA TTTCAGTGGT 

138 CACATTTCGA GATTCACAGC CATCCATCCA TATGGAAAAA TCCATTCCCA TGCTTCCTCC 

139 CCCCCACTAT GTATGTGACC ACACGCTGCT GTCAGAATGC CAACGGTCTC AGGTACCCTC 

140 GTCCGACTGT TTGGCATGGA GTTACATACA CTACTAGTGT AGCCCCGGGC CAAGCTACCC 

141 CGTCAAATCT ATACATATCT ATAATGGGTT TCAGGTGTTT CGTTCGCTGT CAATCAAGTT 

142 TGAAACATCA CTGGGGCCGT TGGACGGTGT ATTAGACCAT TGGCTCCCTC AGCTGGCGGC 

143 TGGGCGGTTG GGTCGGCAAT AACGGGACTG GACTTGAGAG GGACGAGGAG AGTCGGTTGG 

144 CTGCCTACAC TACACTACAA GCGTTCCCAC CTAACCGACG AGTCCCGTTT TCCATTTGTG 

145 TGCCTTAACC ATCATCTAGG GATGTCAGGG TTTGGCCGGA TCAGGGTATG TTTGGTTGAC 

146 TGTTGTCATG TCTGATTGGG TACATATCAT GGTAGGTGTC TCGAGAACAG TAGAGTACTC 

147 GGGCCTAGCG TTTGGATGAT TACGCGAGAT ATGAGTTGTA GGCCGCCATG CAGTTGCTTG 

148 CCCATAAGCA GAAGTTGCTT TGGGATATAT TTCTCGTCTT TCAAAGGTCA CGAGGTCCTG 

149 GGACGAGCGG CATCGCCATC CAAAGGGTTG AACATGAGAA ACCGGAATGG CCTTTGCGTT 

150 GAAATACAAA AAGTCAAGAA TAAAATCGCT TGAGGATAGG GACGTGGAAG CAAGCAAATA 

151 TGGTAAGGGA GGTACTGCTA TGTAGGTGCT CAGCAAACTG CCAATTTCTT GGCCCCCAAG 

152 CAGCAGTTTG CTGTCAGTGC TGCTCGTGTC AGCCTTGGTA GTGGAACCTA AACTGCTAAC 

153 ACAGCGCAAG TGCGCATGTA AAGATATTGT GGGAGGATCT GTATGGATGG ATGAGATTAC 

154 TGCTTGGTGT TGGTTGCGAG GCACTGCGGC TGTTAGGCTT TGCTGTGCCC CGTTCGACGA 

155 AGAAATACGC GGAACTATAA ATTGGATACC TAGACTTACT GCCTATGGGA GGTATCTACC 

156 GACGTAGCCG ACGGATTCTA GCAACATCCC GACTTTGCTT GTAGTGTACT ATGATAGCAG 

157 CACAGTGGGG TGTTGCTCCT TGTGAGCATG GGCTCTTTTT TTTTTTTTCC CCCTTCCCTA 

158 GGGCGTTGAC TGGACTTGCT CTATCGTTCC CAAGGTAGGT GCCCGTCATC GATTTTCCCA 

159 AGCCGTCTCC CGCCAGATTG TCGTCATAGT GTCATGATGA CCTCGGTCGC TGGGGCTGCG 

160 TGGTTACGGG GAGCTGGGAC CGCTAGGCCT CAGTGGTTGT GCCATTCAGC GTGGGTGTGT 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
486.0 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
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161 GGAGTAGCGG TAGAGGCGCT TGGAAGTTGT GCTAGCGGAA ACCCTGGAAT ATCTTGTACC 

162 CTTCGATTCC TTCTCGGGCT GCCCATGTGC TGAGGTGATG CCGGGGATCT GGCGCCAATC 

163 ATCCATTGAG GTTCCCGCAG CTTCCCGGTG CCGCGCGCGG GCGCAGTTGC TCACAGGACA 

164 CACCTAGACG CAGGGGCACA GGGGCACCGT TTGGTGTGCA ACTGGGTACC TGGTAGCTGT 

165 AGCAAGCACT CCACCGTCTG TGCAATCCCC CAATCCACGG CAGGAACTTA GCACCGCCGC 

166 GGCACCGAGT GAGCGAATCC ATCCGCATTG GATCCCAATT CTTGCCCTTG CCATCCTTCT 

167 TTCTTCCCAC TTGGCGCAAC CAACACTTCC CTTGGTCTGG GTACTCGTGT TGATCTTCAC 

168 TCTCTTTTTT TCTTGGGCGA CCGACTTTTT ATATCCGTCC TTGCTTCCCC CTGGCCGTTG 

169 TCGTTCTTTC TACAACTACC TTCCGTTCAT TATCCCCTTT CTTGGTTCGG TCGAGGACCC 

170 AAAAACAGAA CAATTCCGGC TCTTCCAGGT GGCTTGGGTG CGACTGTTTA GCTCTTGACC 

171 ACTAGCCGCT TACCTTCTCT TGATGTTTAT ATTTGGATAT CATTGAACTA CTCTTTCTTG 

172 AAACGGCAGA CGAACGGAAC AGTCCCTACG GTTTATTAGC GATATACGTT GTACTGATAT 

173 CCTGAGCAAG AAGAGGCAAA TTATCAATTA TGCATCTCCC ATCGTCGCTG CTCATCGCAG 

174 CTCCCTTGCT CGCCAATGTA TCGGCCGAAC CGATTAGGAT ACCCCAACGC GATGTTCTCC 

175 GTGGTATCAA CATCACAGCA ACTTGCCGTT CGAGCACTAC CGAATTCGCC CAGCGGTGGA 

176 TATGCCCCTG CCGTTGTAGA CTGTCCCAAG ACCAAGCCGA CGCTCCGGAA GGCCGTGGAT 

177 TTGTCGAACG AGGAGAAGAA CTGGTTGTCG ATCCGGAGGA AGAACACCAT CCAGCCCATG 

178 AGGGACCTAC TGAAGAGGGC CAACATCACT GGGTTCGATT CCGAAACTTT CATGAATGAG 

179 GCCGCCAACA ACGTCTCGCA ACTGCCCAAT GTCGCCATTG CCATTTCAGG AGGCGGCTAT 

180 CGTGCCCTCA TGAACGGCGC CGGCTTCGTT GCTGCTGCGG ATAACCGGAT TCAAAATACC 

181 ACGGGCGCAG GTGGTATTGG AGGCTTGTTG CAGTCCAGCA CATATTTGTA TGTA/y^ACCA 

182 TGCCTTCTTG TGGTTCTTCT TATCTCGTTT TCGAGTGTCA ACTGCGCCAG TTCGACGTTG 

183 GGCGGCTGTG GACGACCTTG CTGGTGAACA TGTCTTGGAC TCCATGCCCC TTTTTTCGTT 

184 CCCTAAAATC CCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAATTCGAG 

185 GACCGTGACT GTAAATTGCT AACGCAACTC TAGGGCCGGA CTTTCTGGTG GTGGCTGGCT 

186 TGTCGGCAGT TTGTTCTCCA ACAACTTCAG TAGCATTGAG ACCCTGCTGA GCGAGAACAA 

187 AGTCTGGGAC TTTGAGAACT CCATCTTTAA AGGACCCAAG GAGGCTGGCC TTAGTACTGT 

188 CAACCGTATC CAGTACTGGT CCGAAGTGGC AAAGGAAGTT GCGAAGAAGA AGGATGCTGG 

189 CTTCGAGACA AGTATAACAG ACTACTGGGG CCGAGCATTG AGTTACCAAC TGATCGGAGC 

190 CGATATGGGC GGCCCGGCTT ACACCTTCTC CAGCATTGCC CAGACCGACA ACTTCCAGAA 

191 GGCCGAAACG CCGTTCCCTA TTCTGGTAGC TGACGGCCGC GCGCCTGGAG ACACCATCAT 

192 CTCCCTCAAT GCTACCAACT ACGAGTTCAA CCCGTTCGAG ACGGGTAGCT GGGACCCGAC 

193 CGTCTATGGC TTTGCGCCGA CCAAGTACCT CGGCGCCAAC TTCAGCAACG GCGTGATCCC 

194 ATCGGGAGGC AAGTGCGTTG AGGGTCTCGA CCAAGCCGGC TTCGTCATGG GCACCAGCAG 

195 CACGCTCTTC AACCAGTTCC TTTTGGCCAA CATCTCCAGC TACGACGGTG TTGCCAGACG 

196 TGCTCATCGA GGCCGTGACT TCTGTCCTCA AGGAAATCGG CGCCAAGAGG ACGACGTCTC 

197 CCAAATCATC CCTAATCCGT TCCTGGACTG GAACAACCGG ACCAACCCCA ACGCCGACAC 

198 GCTCGAGCTC GACCTGGTCG ACGGCGGCGA AGATCTGCAG AATATTCCGC TCAACCCGCT 

199 CACCCAACCC GTGCGCGCCG TCGACGTCAT CTTCGCTGTC GACTCGTCCG CCGACGTGAC 

200 AAACTGGCCC AATGGCACCG CCCTGCGCGC CACCTACGAG CGCACTTTCG GCTCTATTTC 

201 CAACGGGACA CTCTTCCCCT CGATCCCCGA CGACTGGACG TTTATAAACC TAGGCCTCAA 

202 CAACCGCCCC TCTTTCTTCG GCTGCGATGT TAAGAACTTT ACCTTGAACG CCAACCAAAA 

203 GGTTCCCCCC TTAATCGTCT ATGTCCCCAA CGCGCCCTAT ACCGCGCTGA GCAACGTGTC 

204 CACCTTCGAT CCGTCATACA CGATGTCTCA GCGCAACGAC ATCATCGGCA ACGGATGGAA 

205 CTCAGCCACG CAGGGAAACG GCACGCTGGA TTCGGAGTGG CCCACTTGCG TCGCCTGCGC 

206 GGTTATCAGC AGGAGCTTAG ATCGGTTGGG CAGGCAGACG CCAGCCGCGT GCAAGACTTG 

207 CTTTGACAGG TATTGCTGGA ATGGCACAGT GAACTCCAAA GATACGGGGG TTTACATGCC 

208 TGAGTTCAAG ATTGCGGATG CGCATGCCCT GGACTCGGGT GCTGTTGCTA TCGGAAAGAT 

209 GGTGAATGTC TGGTCGTCGG TTGTGGTGGG AGTTGTGGCG GCTACTTTGT TGTTGTAGGG 



6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
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210 GTAGGGGAGA CGTGATGATA TTCCAGTCTG ATGAAGTTGA GACTGGACTG GAGATCGCCA 

211 AGGATGCGGA GGGAAAGGAA TGCGTGGTGT TAATGTCATG ATGGATGAAG AGTCATGGAT 

212 CATGGAACGA CGGGGCGGGG ATATTGGATG ATGGATATAC CACACTGCAT GCATGCTCTA 

213 TTGATAGTAT GCTTTGGCAT TTACGTTTAA CAATCAATTG CTCCATCCTG ATGTTCTATC 

214 TTTTTCGACA ATGGATTGAT ACTACTCCTG TTGCTTCGCT CTTGAGGTTG GAAGGACTTG 

215 AGGTTGGAAG GACTTGAGGT TGTTTGTTCT GAGGGAGGTT ATCGAAGTAT CATCTGTGCT 

216 GATGCCGATT GATAGACTGT CCTCTTCTTC GAGGCAACGA ACGGTCGGAT GAGCCTCTTT 

217 AATCATGATG CTCAGTGCCA CAAAAAGGCT CCAGCACAGC TGCCCACACC TTTCTTGCCT 

218 CGCCGTTCCT TCCTTTTTCT TTTCCCCTGT TTCCTTTCTT CCTTTCCATC TCATCCCGTA 

219 CCAGAGTGCC CACCGGGTAT ATATATTACC TCCTTGGCCG TTCTCCTTTG ACCAATAAAT 

220 CGCTTGGTCG AGTGGCGTAA CGGTTTACCG TCTACACTTA TCACTCAAAC CAAACCAAAC 

221 CATCGAAGAA GTGACCTATC GGTTCGAGGG AACGGTGATG TTCTTACGAC CAAGTTAACC 

222 CAAAGAGCGT TCCACATCGT TGAACCGTCT CCTCCAGTTG GATCTGTTTA ACTTCCGCAG 

223 CGACTGAAGA AGGTATCACT TTTTTTTTGG TTCCAAAAAA AAAAAAAAAA ATT AC 
226 (2) INfFORMATION FOR SEQ ID NO: 2: 

228 (i) SEQUENCE CHARACTERISTICS: 

229 (A) LENGTH: 9934 base pairs 

230 (B) TYPE: nucleic acid 

231 (C) STRANDEDNESS : single 

232 (D) TOPOLOGY: linear 

234 (ii) MOLECULE TYPE: Genomic DNA 

236 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

238 ACCGGGAATC GTAGCGGGCG CTAAGGCCAA GCCGCGGCAC GGGTCACTGA CCCAATGCAG 

239 CGCATTCCGT CAGCAACTGA AGTGGATGTA CAAGTACATA GTAGTAGATC GCAACTGGAG 

240 ATCACTCGCA CCGTGCCGCA GAACAAGGGC GACGAGCCTC AGGGCAGTTT AGCCTGCCGT 

241 AACAGCACAG ACCATAGCTT ATTTTCACCT GGGCGGGCGG GCGACGGCGG CACTGACATC 

242 GGCAAGGCGG CATCAAGCAA CCCCTCTGTT GCTTGCCAGC TGCCGGCCAA CGTCAGCGGT 

243 ACAAGGAGAA ATCTGGAAGG AAAGACTTCT GGCACCGACA GGATGGCACG CGGGAAAAGT 

244 TCCCAATGCA TGAGATGAGG GGCATTTGCA TTGCCTCCCG TCACCCAGTG CGAACCCCAA 

245 CCCCACCATA GCGTCTGTCG ATACATGGAG CGCGAAGTCG AGAAACCTGT AATTCCTGGT 

246 AACTTTCAGG TACACAGTAC GTACTGATCC TGGTATCAAA CCTTGCCTGC CGAGTTTTCG 

247 ACGGAAAGAG GTGTGAATTG TGAAAGAGTC ATACCAAATC ACCCGATTTT CATAAAGCCC 

248 GAGTCTTTTC TGTACATAAG CGACACTCGA AGCGGGCCTC ATCTTCATAG CCTGATAGCT 

249 TGTAATACTC CATCCTCGTA TCTCACTTGA CCTTGAGTTC AACCCCACGT CAAACTTCAC 

250 CCGACACATC GACGGATTGG GGAACAGCAC AATACCTGAA AAGCGAGAAA ACCAAACAGA 

251 GGAAAACACC ATGGAGACAA CACTTCCCCT CCCCTTCCTC GTCGGTGTCA GTGTTCCTCC 

252 CGGACTGAAT GACATCAAGG AGGGCCTCAG CCGGGAGGAA GTCTCGTGTC TTGGCTGCGT 

253 CTTCTTCGAG GTCAAGCCCC AGACCCTTGA GAAAATCCTG CGATTCCTCA AGCGTCACAA 

254 TGTCGAATTT GAGCCCTACT TCGATGTAAC AGCCCTCGAG TCTATCGATG ATATTATCAC 

255 TCTTCTGGAC GCCGGCGCCC GCAAGGTGTT TGTCAAGACC GAGCAGTTGG CCGACCTCTC 

256 CGCATATGGC TCCCGCGTTG CCCCCATTGT CACTGGAAGC AGCGCTGCTT TGCTTTCCTC 

257 CGCCACCGAG AGCGGCCTTT TGCTCTCCGG CTTCGATCAG ACTGCCTCCG AGGCTGCACA 

258 GTTTCTGGAG GAGGCCAGAG ACAAGAAAAT TACCCCCTTC TTCATCAAGC CCGTTCCTGG 

259 GGCCGATCTC GAACAGTTCA TCCAGGTCGC CGCCAAGGCT AACGCCATCC CCATCCTGCC 

260 ATCCACTGGC TTGACAACAA AGAAGGACGA GGCCGGCAAG CTTGCCATCT CCACCATCCT 

261 CTCGAGCGTC TGGAAGTCTG ACCGTCCCGA TGGTCTTCTC CCCACCGTTG TCGTTGATGA 

262 GCACGACACT GCTCTGGGTC TGGTCTACAG CAGTGCCGAG AGTGTGAACG AGGCCCTCAG 

263 GACACAGACT GGTGTCTATC AGAGCCGGAA GCGCGGTCTC TGGTACAAGG GTGCTACTTC 

264 CGGAGACACT CAGGAGCTCG TCCGCATCTC GCTTGACTGC GATAACGATG CTCTCAAGTT 
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VERIFICATION SUMMARY DATE: 01/24/2002 

PATENT APPLICATION: US/09/751r 962 TIME: 18:46:53 



Input Set : N:\Crf3\RULE60\09751962.raw 
Output Set: N:\CRF3\01242002\I751962.raw 

L:27 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
L:28 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:411 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 2 



file://C:\CRF3\Outhold\VsrI75 1962.htm 



